Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 18448 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 3 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 3.5 MiB |
| Average record size in memory | 200.0 B |
Variable types
| Numeric | 23 |
|---|---|
| Categorical | 2 |
| Dataset has 3 (< 0.1%) duplicate rows | Duplicates |
price is highly correlated with num_bath and 2 other fields | High correlation |
num_bed is highly correlated with num_bath and 1 other fields | High correlation |
num_bath is highly correlated with price and 5 other fields | High correlation |
size_house is highly correlated with price and 3 other fields | High correlation |
size_lot is highly correlated with avg_size_neighbor_lot | High correlation |
num_floors is highly correlated with num_bath and 1 other fields | High correlation |
year_built is highly correlated with num_bath and 1 other fields | High correlation |
zip is highly correlated with longitude and 9 other fields | High correlation |
latitude is highly correlated with warehousedist and 5 other fields | High correlation |
longitude is highly correlated with zip and 9 other fields | High correlation |
avg_size_neighbor_houses is highly correlated with price and 2 other fields | High correlation |
avg_size_neighbor_lot is highly correlated with size_lot | High correlation |
schooldist is highly correlated with zip and 9 other fields | High correlation |
supermarketdist is highly correlated with zip and 9 other fields | High correlation |
warehousedist is highly correlated with zip and 10 other fields | High correlation |
churchdist is highly correlated with zip and 10 other fields | High correlation |
collegedist is highly correlated with zip and 10 other fields | High correlation |
hospitaldist is highly correlated with zip and 10 other fields | High correlation |
train_stationdist is highly correlated with zip and 10 other fields | High correlation |
universitydist is highly correlated with zip and 10 other fields | High correlation |
hangardist is highly correlated with zip and 9 other fields | High correlation |
price is highly correlated with num_bath and 2 other fields | High correlation |
num_bed is highly correlated with num_bath and 1 other fields | High correlation |
num_bath is highly correlated with price and 5 other fields | High correlation |
size_house is highly correlated with price and 3 other fields | High correlation |
size_lot is highly correlated with avg_size_neighbor_lot | High correlation |
num_floors is highly correlated with num_bath | High correlation |
year_built is highly correlated with num_bath | High correlation |
zip is highly correlated with longitude and 8 other fields | High correlation |
latitude is highly correlated with schooldist and 7 other fields | High correlation |
longitude is highly correlated with zip and 9 other fields | High correlation |
avg_size_neighbor_houses is highly correlated with price and 2 other fields | High correlation |
avg_size_neighbor_lot is highly correlated with size_lot | High correlation |
schooldist is highly correlated with zip and 10 other fields | High correlation |
supermarketdist is highly correlated with zip and 9 other fields | High correlation |
warehousedist is highly correlated with zip and 10 other fields | High correlation |
churchdist is highly correlated with zip and 10 other fields | High correlation |
collegedist is highly correlated with zip and 10 other fields | High correlation |
hospitaldist is highly correlated with zip and 10 other fields | High correlation |
train_stationdist is highly correlated with zip and 10 other fields | High correlation |
universitydist is highly correlated with zip and 10 other fields | High correlation |
hangardist is highly correlated with latitude and 9 other fields | High correlation |
num_bed is highly correlated with size_house | High correlation |
num_bath is highly correlated with size_house | High correlation |
size_house is highly correlated with num_bed and 2 other fields | High correlation |
size_lot is highly correlated with avg_size_neighbor_lot | High correlation |
latitude is highly correlated with collegedist | High correlation |
longitude is highly correlated with schooldist and 4 other fields | High correlation |
avg_size_neighbor_houses is highly correlated with size_house | High correlation |
avg_size_neighbor_lot is highly correlated with size_lot | High correlation |
schooldist is highly correlated with longitude and 8 other fields | High correlation |
supermarketdist is highly correlated with longitude and 8 other fields | High correlation |
warehousedist is highly correlated with longitude and 8 other fields | High correlation |
churchdist is highly correlated with longitude and 8 other fields | High correlation |
collegedist is highly correlated with latitude and 8 other fields | High correlation |
hospitaldist is highly correlated with longitude and 8 other fields | High correlation |
train_stationdist is highly correlated with schooldist and 7 other fields | High correlation |
universitydist is highly correlated with schooldist and 7 other fields | High correlation |
hangardist is highly correlated with schooldist and 7 other fields | High correlation |
price is highly correlated with num_bath and 3 other fields | High correlation |
num_bed is highly correlated with num_bath and 1 other fields | High correlation |
num_bath is highly correlated with price and 5 other fields | High correlation |
size_house is highly correlated with price and 4 other fields | High correlation |
size_lot is highly correlated with avg_size_neighbor_lot and 1 other fields | High correlation |
num_floors is highly correlated with year_built | High correlation |
condition is highly correlated with year_built | High correlation |
size_basement is highly correlated with price and 2 other fields | High correlation |
year_built is highly correlated with num_bath and 8 other fields | High correlation |
zip is highly correlated with year_built and 11 other fields | High correlation |
latitude is highly correlated with zip and 9 other fields | High correlation |
longitude is highly correlated with year_built and 10 other fields | High correlation |
avg_size_neighbor_houses is highly correlated with price and 2 other fields | High correlation |
avg_size_neighbor_lot is highly correlated with size_lot | High correlation |
schooldist is highly correlated with zip and 10 other fields | High correlation |
supermarketdist is highly correlated with zip and 10 other fields | High correlation |
warehousedist is highly correlated with zip and 10 other fields | High correlation |
churchdist is highly correlated with zip and 10 other fields | High correlation |
collegedist is highly correlated with year_built and 11 other fields | High correlation |
hospitaldist is highly correlated with size_lot and 12 other fields | High correlation |
train_stationdist is highly correlated with year_built and 11 other fields | High correlation |
universitydist is highly correlated with year_built and 11 other fields | High correlation |
hangardist is highly correlated with zip and 10 other fields | High correlation |
size_basement has 11174 (60.6%) zeros | Zeros |
renovation_date has 17661 (95.7%) zeros | Zeros |
Reproduction
| Analysis started | 2022-12-01 21:24:55.585522 |
|---|---|
| Analysis finished | 2022-12-01 21:25:59.943220 |
| Duration | 1 minute and 4.36 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 3670 |
|---|---|
| Distinct (%) | 19.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 542362.3713 |
| Minimum | 78000 |
|---|---|
| Maximum | 7700000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 78000 |
|---|---|
| 5-th percentile | 210000 |
| Q1 | 321837.5 |
| median | 450000 |
| Q3 | 648000 |
| 95-th percentile | 1180000 |
| Maximum | 7700000 |
| Range | 7622000 |
| Interquartile range (IQR) | 326162.5 |
Descriptive statistics
| Standard deviation | 372013.519 |
|---|---|
| Coefficient of variation (CV) | 0.6859132173 |
| Kurtosis | 36.39093848 |
| Mean | 542362.3713 |
| Median Absolute Deviation (MAD) | 150000 |
| Skewness | 4.115766647 |
| Sum | 1.000550103 × 1010 |
| Variance | 1.383940583 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 350000 | 145 | 0.8% |
| 450000 | 142 | 0.8% |
| 550000 | 134 | 0.7% |
| 500000 | 128 | 0.7% |
| 425000 | 127 | 0.7% |
| 400000 | 121 | 0.7% |
| 325000 | 120 | 0.7% |
| 300000 | 119 | 0.6% |
| 375000 | 115 | 0.6% |
| 525000 | 113 | 0.6% |
| Other values (3660) | 17184 |
| Value | Count | Frequency (%) |
| 78000 | 1 | |
| 80000 | 1 | |
| 81000 | 1 | |
| 82500 | 1 | |
| 83000 | 1 | |
| 84000 | 1 | |
| 85000 | 1 | |
| 86500 | 1 | |
| 89000 | 1 | |
| 89950 | 1 |
| Value | Count | Frequency (%) |
| 7700000 | 1 | |
| 7062500 | 1 | |
| 6885000 | 1 | |
| 5570000 | 1 | |
| 5350000 | 1 | |
| 5300000 | 1 | |
| 5110800 | 1 | |
| 4668000 | 1 | |
| 4500000 | 1 | |
| 4489000 | 1 |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.372614918 |
| Minimum | 0 |
|---|---|
| Maximum | 33 |
| Zeros | 12 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 33 |
| Range | 33 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9338924435 |
|---|---|
| Coefficient of variation (CV) | 0.2769045581 |
| Kurtosis | 56.20934093 |
| Mean | 3.372614918 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.216603139 |
| Sum | 62218 |
| Variance | 0.872155096 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 8403 | |
| 4 | 5863 | |
| 2 | 2358 | 12.8% |
| 5 | 1361 | 7.4% |
| 6 | 238 | 1.3% |
| 1 | 157 | 0.9% |
| 7 | 34 | 0.2% |
| 0 | 12 | 0.1% |
| 8 | 12 | 0.1% |
| 9 | 6 | < 0.1% |
| Other values (2) | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 12 | 0.1% |
| 1 | 157 | 0.9% |
| 2 | 2358 | 12.8% |
| 3 | 8403 | |
| 4 | 5863 | |
| 5 | 1361 | 7.4% |
| 6 | 238 | 1.3% |
| 7 | 34 | 0.2% |
| 8 | 12 | 0.1% |
| 9 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 33 | 1 | < 0.1% |
| 10 | 3 | < 0.1% |
| 9 | 6 | < 0.1% |
| 8 | 12 | 0.1% |
| 7 | 34 | 0.2% |
| 6 | 238 | 1.3% |
| 5 | 1361 | 7.4% |
| 4 | 5863 | |
| 3 | 8403 | |
| 2 | 2358 | 12.8% |
| Distinct | 30 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.118888226 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 7 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1.75 |
| median | 2.25 |
| Q3 | 2.5 |
| 95-th percentile | 3.5 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 0.75 |
Descriptive statistics
| Standard deviation | 0.7723841466 |
|---|---|
| Coefficient of variation (CV) | 0.3645233085 |
| Kurtosis | 1.443049749 |
| Mean | 2.118888226 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 0.5379165743 |
| Sum | 39089.25 |
| Variance | 0.5965772699 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.5 | 4607 | |
| 1 | 3278 | |
| 1.75 | 2576 | |
| 2.25 | 1764 | 9.6% |
| 2 | 1642 | 8.9% |
| 1.5 | 1232 | 6.7% |
| 2.75 | 1012 | 5.5% |
| 3 | 648 | 3.5% |
| 3.5 | 618 | 3.3% |
| 3.25 | 514 | 2.8% |
| Other values (20) | 557 | 3.0% |
| Value | Count | Frequency (%) |
| 0 | 7 | < 0.1% |
| 0.5 | 3 | < 0.1% |
| 0.75 | 57 | 0.3% |
| 1 | 3278 | |
| 1.25 | 6 | < 0.1% |
| 1.5 | 1232 | 6.7% |
| 1.75 | 2576 | |
| 2 | 1642 | 8.9% |
| 2.25 | 1764 | 9.6% |
| 2.5 | 4607 |
| Value | Count | Frequency (%) |
| 8 | 2 | < 0.1% |
| 7.75 | 1 | < 0.1% |
| 7.5 | 1 | < 0.1% |
| 6.75 | 2 | < 0.1% |
| 6.5 | 2 | < 0.1% |
| 6.25 | 2 | < 0.1% |
| 6 | 6 | |
| 5.75 | 4 | < 0.1% |
| 5.5 | 10 | |
| 5.25 | 13 |
| Distinct | 956 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2083.940915 |
| Minimum | 290 |
|---|---|
| Maximum | 13540 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 290 |
|---|---|
| 5-th percentile | 940 |
| Q1 | 1430 |
| median | 1920 |
| Q3 | 2560 |
| 95-th percentile | 3770 |
| Maximum | 13540 |
| Range | 13250 |
| Interquartile range (IQR) | 1130 |
Descriptive statistics
| Standard deviation | 921.4162178 |
|---|---|
| Coefficient of variation (CV) | 0.442150836 |
| Kurtosis | 5.633591899 |
| Mean | 2083.940915 |
| Median Absolute Deviation (MAD) | 550 |
| Skewness | 1.502606641 |
| Sum | 38444542 |
| Variance | 849007.8465 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1440 | 125 | 0.7% |
| 1300 | 119 | 0.6% |
| 1820 | 113 | 0.6% |
| 1660 | 111 | 0.6% |
| 1800 | 110 | 0.6% |
| 1320 | 108 | 0.6% |
| 1400 | 107 | 0.6% |
| 1010 | 106 | 0.6% |
| 1250 | 106 | 0.6% |
| 1480 | 106 | 0.6% |
| Other values (946) | 17337 |
| Value | Count | Frequency (%) |
| 290 | 1 | |
| 380 | 1 | |
| 384 | 1 | |
| 390 | 2 | |
| 410 | 1 | |
| 420 | 1 | |
| 430 | 1 | |
| 460 | 1 | |
| 480 | 1 | |
| 490 | 1 |
| Value | Count | Frequency (%) |
| 13540 | 1 | |
| 12050 | 1 | |
| 10040 | 1 | |
| 9890 | 1 | |
| 9640 | 1 | |
| 9200 | 1 | |
| 8670 | 1 | |
| 8020 | 1 | |
| 8010 | 1 | |
| 8000 | 1 |
| Distinct | 8807 |
|---|---|
| Distinct (%) | 47.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15036.02407 |
| Minimum | 520 |
|---|---|
| Maximum | 1651359 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 520 |
|---|---|
| 5-th percentile | 1824 |
| Q1 | 5050 |
| median | 7600.5 |
| Q3 | 10625.25 |
| 95-th percentile | 42727.4 |
| Maximum | 1651359 |
| Range | 1650839 |
| Interquartile range (IQR) | 5575.25 |
Descriptive statistics
| Standard deviation | 41814.54897 |
|---|---|
| Coefficient of variation (CV) | 2.780957837 |
| Kurtosis | 299.5845778 |
| Mean | 15036.02407 |
| Median Absolute Deviation (MAD) | 2600.5 |
| Skewness | 13.39725552 |
| Sum | 277384572 |
| Variance | 1748456505 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5000 | 309 | 1.7% |
| 6000 | 249 | 1.3% |
| 4000 | 218 | 1.2% |
| 7200 | 194 | 1.1% |
| 7500 | 100 | 0.5% |
| 4800 | 99 | 0.5% |
| 9600 | 96 | 0.5% |
| 8400 | 92 | 0.5% |
| 4500 | 91 | 0.5% |
| 3600 | 87 | 0.5% |
| Other values (8797) | 16913 |
| Value | Count | Frequency (%) |
| 520 | 1 | |
| 572 | 1 | |
| 600 | 1 | |
| 609 | 1 | |
| 635 | 1 | |
| 649 | 2 | |
| 675 | 1 | |
| 676 | 1 | |
| 681 | 1 | |
| 683 | 1 |
| Value | Count | Frequency (%) |
| 1651359 | 1 | |
| 1164794 | 1 | |
| 1074218 | 1 | |
| 1024068 | 1 | |
| 982998 | 1 | |
| 982278 | 1 | |
| 920423 | 1 | |
| 881654 | 1 | |
| 871200 | 1 | |
| 715690 | 1 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.494606461 |
| Minimum | 1 |
|---|---|
| Maximum | 3.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1.5 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 3.5 |
| Range | 2.5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.5408059205 |
|---|---|
| Coefficient of variation (CV) | 0.3618383397 |
| Kurtosis | -0.4790163078 |
| Mean | 1.494606461 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 0.6188375561 |
| Sum | 27572.5 |
| Variance | 0.2924710437 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 9124 | |
| 2 | 7030 | |
| 1.5 | 1617 | 8.8% |
| 3 | 525 | 2.8% |
| 2.5 | 144 | 0.8% |
| 3.5 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 9124 | |
| 1.5 | 1617 | 8.8% |
| 2 | 7030 | |
| 2.5 | 144 | 0.8% |
| 3 | 525 | 2.8% |
| 3.5 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 3.5 | 8 | < 0.1% |
| 3 | 525 | 2.8% |
| 2.5 | 144 | 0.8% |
| 2 | 7030 | |
| 1.5 | 1617 | 8.8% |
| 1 | 9124 |
is_waterfront
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 144.2 KiB |
| 0 | |
|---|---|
| 1 | 141 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 18448 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 18307 | |
| 1 | 141 | 0.8% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 18307 | |
| 1 | 141 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 18307 | |
| 1 | 141 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 18448 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 18307 | |
| 1 | 141 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 18448 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 18307 | |
| 1 | 141 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18448 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 18307 | |
| 1 | 141 | 0.8% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 144.2 KiB |
| 3 | |
|---|---|
| 4 | |
| 5 | |
| 2 | 150 |
| 1 | 26 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 18448 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 3 |
| 3rd row | 3 |
| 4th row | 5 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 11941 | |
| 4 | 4865 | |
| 5 | 1466 | 7.9% |
| 2 | 150 | 0.8% |
| 1 | 26 | 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 3 | 11941 | |
| 4 | 4865 | |
| 5 | 1466 | 7.9% |
| 2 | 150 | 0.8% |
| 1 | 26 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 11941 | |
| 4 | 4865 | |
| 5 | 1466 | 7.9% |
| 2 | 150 | 0.8% |
| 1 | 26 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 18448 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 11941 | |
| 4 | 4865 | |
| 5 | 1466 | 7.9% |
| 2 | 150 | 0.8% |
| 1 | 26 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 18448 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 11941 | |
| 4 | 4865 | |
| 5 | 1466 | 7.9% |
| 2 | 150 | 0.8% |
| 1 | 26 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18448 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 11941 | |
| 4 | 4865 | |
| 5 | 1466 | 7.9% |
| 2 | 150 | 0.8% |
| 1 | 26 | 0.1% |
| Distinct | 283 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 293.5714983 |
| Minimum | 0 |
|---|---|
| Maximum | 4820 |
| Zeros | 11174 |
| Zeros (%) | 60.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 570 |
| 95-th percentile | 1190 |
| Maximum | 4820 |
| Range | 4820 |
| Interquartile range (IQR) | 570 |
Descriptive statistics
| Standard deviation | 443.6075028 |
|---|---|
| Coefficient of variation (CV) | 1.511071427 |
| Kurtosis | 2.755863337 |
| Mean | 293.5714983 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.570351748 |
| Sum | 5415807 |
| Variance | 196787.6165 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 11174 | |
| 600 | 190 | 1.0% |
| 700 | 182 | 1.0% |
| 800 | 180 | 1.0% |
| 500 | 174 | 0.9% |
| 400 | 164 | 0.9% |
| 1000 | 133 | 0.7% |
| 900 | 126 | 0.7% |
| 300 | 115 | 0.6% |
| 530 | 93 | 0.5% |
| Other values (273) | 5917 |
| Value | Count | Frequency (%) |
| 0 | 11174 | |
| 10 | 2 | < 0.1% |
| 40 | 4 | < 0.1% |
| 50 | 8 | < 0.1% |
| 60 | 9 | < 0.1% |
| 65 | 1 | < 0.1% |
| 70 | 7 | < 0.1% |
| 80 | 17 | 0.1% |
| 90 | 20 | 0.1% |
| 100 | 37 | 0.2% |
| Value | Count | Frequency (%) |
| 4820 | 1 | |
| 4130 | 1 | |
| 3500 | 1 | |
| 3480 | 1 | |
| 3000 | 1 | |
| 2850 | 1 | |
| 2810 | 1 | |
| 2730 | 1 | |
| 2720 | 1 | |
| 2620 | 1 |
| Distinct | 116 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1971.001138 |
| Minimum | 1900 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 1900 |
|---|---|
| 5-th percentile | 1915 |
| Q1 | 1952 |
| median | 1975 |
| Q3 | 1997 |
| 95-th percentile | 2011 |
| Maximum | 2015 |
| Range | 115 |
| Interquartile range (IQR) | 45 |
Descriptive statistics
| Standard deviation | 29.36161911 |
|---|---|
| Coefficient of variation (CV) | 0.01489680475 |
| Kurtosis | -0.6556912762 |
| Mean | 1971.001138 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | -0.4721390602 |
| Sum | 36361029 |
| Variance | 862.104677 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2014 | 468 | 2.5% |
| 2006 | 380 | 2.1% |
| 2004 | 378 | 2.0% |
| 2005 | 372 | 2.0% |
| 2003 | 367 | 2.0% |
| 1977 | 359 | 1.9% |
| 2007 | 358 | 1.9% |
| 1978 | 330 | 1.8% |
| 2008 | 319 | 1.7% |
| 1968 | 315 | 1.7% |
| Other values (106) | 14802 |
| Value | Count | Frequency (%) |
| 1900 | 75 | |
| 1901 | 25 | 0.1% |
| 1902 | 20 | 0.1% |
| 1903 | 37 | |
| 1904 | 41 | |
| 1905 | 63 | |
| 1906 | 77 | |
| 1907 | 52 | |
| 1908 | 73 | |
| 1909 | 83 |
| Value | Count | Frequency (%) |
| 2015 | 34 | 0.2% |
| 2014 | 468 | |
| 2013 | 169 | 0.9% |
| 2012 | 144 | 0.8% |
| 2011 | 109 | 0.6% |
| 2010 | 122 | 0.7% |
| 2009 | 197 | |
| 2008 | 319 | |
| 2007 | 358 | |
| 2006 | 380 |
| Distinct | 68 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 85.14500217 |
| Minimum | 0 |
|---|---|
| Maximum | 2015 |
| Zeros | 17661 |
| Zeros (%) | 95.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 2015 |
| Range | 2015 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 403.3712626 |
|---|---|
| Coefficient of variation (CV) | 4.737462592 |
| Kurtosis | 18.49656836 |
| Mean | 85.14500217 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.526923714 |
| Sum | 1570755 |
| Variance | 162708.3755 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 17661 | |
| 2014 | 77 | 0.4% |
| 2003 | 34 | 0.2% |
| 2013 | 33 | 0.2% |
| 2007 | 30 | 0.2% |
| 2000 | 29 | 0.2% |
| 2005 | 29 | 0.2% |
| 1990 | 23 | 0.1% |
| 2004 | 21 | 0.1% |
| 2006 | 21 | 0.1% |
| Other values (58) | 490 | 2.7% |
| Value | Count | Frequency (%) |
| 0 | 17661 | |
| 1934 | 1 | < 0.1% |
| 1940 | 2 | < 0.1% |
| 1944 | 1 | < 0.1% |
| 1945 | 3 | < 0.1% |
| 1946 | 2 | < 0.1% |
| 1948 | 1 | < 0.1% |
| 1950 | 2 | < 0.1% |
| 1951 | 1 | < 0.1% |
| 1953 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2015 | 15 | 0.1% |
| 2014 | 77 | |
| 2013 | 33 | |
| 2012 | 10 | 0.1% |
| 2011 | 13 | 0.1% |
| 2010 | 17 | 0.1% |
| 2009 | 17 | 0.1% |
| 2008 | 16 | 0.1% |
| 2007 | 30 | 0.2% |
| 2006 | 21 | 0.1% |
| Distinct | 70 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98077.92145 |
| Minimum | 98001 |
|---|---|
| Maximum | 98199 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 98001 |
|---|---|
| 5-th percentile | 98004 |
| Q1 | 98033 |
| median | 98065 |
| Q3 | 98118 |
| 95-th percentile | 98177 |
| Maximum | 98199 |
| Range | 198 |
| Interquartile range (IQR) | 85 |
Descriptive statistics
| Standard deviation | 53.49744016 |
|---|---|
| Coefficient of variation (CV) | 0.0005454585433 |
| Kurtosis | -0.8627090916 |
| Mean | 98077.92145 |
| Median Absolute Deviation (MAD) | 42 |
| Skewness | 0.3999911363 |
| Sum | 1809341495 |
| Variance | 2861.976104 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 98103 | 512 | 2.8% |
| 98038 | 504 | 2.7% |
| 98115 | 495 | 2.7% |
| 98117 | 478 | 2.6% |
| 98034 | 477 | 2.6% |
| 98052 | 475 | 2.6% |
| 98042 | 471 | 2.6% |
| 98118 | 445 | 2.4% |
| 98023 | 429 | 2.3% |
| 98006 | 424 | 2.3% |
| Other values (60) | 13738 |
| Value | Count | Frequency (%) |
| 98001 | 313 | |
| 98002 | 169 | 0.9% |
| 98003 | 241 | |
| 98004 | 266 | |
| 98005 | 143 | 0.8% |
| 98006 | 424 | |
| 98007 | 127 | 0.7% |
| 98008 | 243 | |
| 98010 | 84 | 0.5% |
| 98011 | 178 |
| Value | Count | Frequency (%) |
| 98199 | 269 | |
| 98198 | 231 | |
| 98188 | 112 | 0.6% |
| 98178 | 224 | |
| 98177 | 220 | |
| 98168 | 232 | |
| 98166 | 217 | |
| 98155 | 376 | |
| 98148 | 48 | 0.3% |
| 98146 | 249 |
| Distinct | 18324 |
|---|---|
| Distinct (%) | 99.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.5600304 |
| Minimum | 47.15593331 |
|---|---|
| Maximum | 47.77762383 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 47.15593331 |
|---|---|
| 5-th percentile | 47.30984567 |
| Q1 | 47.47152712 |
| median | 47.57159932 |
| Q3 | 47.67791844 |
| 95-th percentile | 47.74987093 |
| Maximum | 47.77762383 |
| Range | 0.62169052 |
| Interquartile range (IQR) | 0.20639132 |
Descriptive statistics
| Standard deviation | 0.1385573676 |
|---|---|
| Coefficient of variation (CV) | 0.002913315371 |
| Kurtosis | -0.6735891043 |
| Mean | 47.5600304 |
| Median Absolute Deviation (MAD) | 0.104722125 |
| Skewness | -0.4865787418 |
| Sum | 877387.4407 |
| Variance | 0.01919814412 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 47.50449008 | 3 | < 0.1% |
| 47.57198731 | 2 | < 0.1% |
| 47.68451447 | 2 | < 0.1% |
| 47.49598478 | 2 | < 0.1% |
| 47.73319725 | 2 | < 0.1% |
| 47.37312328 | 2 | < 0.1% |
| 47.64990089 | 2 | < 0.1% |
| 47.48395631 | 2 | < 0.1% |
| 47.49483567 | 2 | < 0.1% |
| 47.70764014 | 2 | < 0.1% |
| Other values (18314) | 18427 |
| Value | Count | Frequency (%) |
| 47.15593331 | 1 | |
| 47.15932775 | 1 | |
| 47.16219954 | 1 | |
| 47.16467409 | 1 | |
| 47.17749491 | 1 | |
| 47.17757614 | 1 | |
| 47.17764305 | 1 | |
| 47.17945001 | 1 | |
| 47.180318 | 1 | |
| 47.18081934 | 1 |
| Value | Count | Frequency (%) |
| 47.77762383 | 1 | |
| 47.77759209 | 1 | |
| 47.77747655 | 1 | |
| 47.77747455 | 1 | |
| 47.7774599 | 1 | |
| 47.77744884 | 1 | |
| 47.77720903 | 1 | |
| 47.77720312 | 1 | |
| 47.77713999 | 1 | |
| 47.77711975 | 1 |
| Distinct | 18287 |
|---|---|
| Distinct (%) | 99.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -122.2144191 |
| Minimum | -122.5186481 |
|---|---|
| Maximum | -121.315254 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 18448 |
| Negative (%) | 100.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | -122.5186481 |
|---|---|
| 5-th percentile | -122.387008 |
| Q1 | -122.3280844 |
| median | -122.2306878 |
| Q3 | -122.1257329 |
| 95-th percentile | -121.9809708 |
| Maximum | -121.315254 |
| Range | 1.2033941 |
| Interquartile range (IQR) | 0.20235155 |
Descriptive statistics
| Standard deviation | 0.1399104894 |
|---|---|
| Coefficient of variation (CV) | -0.001144795274 |
| Kurtosis | 0.8173737449 |
| Mean | -122.2144191 |
| Median Absolute Deviation (MAD) | 0.10073825 |
| Skewness | 0.8505957554 |
| Sum | -2254611.603 |
| Variance | 0.01957494504 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -122.3301162 | 3 | < 0.1% |
| -122.3642341 | 2 | < 0.1% |
| -122.3907682 | 2 | < 0.1% |
| -122.1531596 | 2 | < 0.1% |
| -122.3790929 | 2 | < 0.1% |
| -122.3792593 | 2 | < 0.1% |
| -122.0564157 | 2 | < 0.1% |
| -122.355521 | 2 | < 0.1% |
| -122.3485889 | 2 | < 0.1% |
| -122.2689077 | 2 | < 0.1% |
| Other values (18277) | 18427 |
| Value | Count | Frequency (%) |
| -122.5186481 | 1 | |
| -122.5147975 | 1 | |
| -122.5139991 | 1 | |
| -122.5112313 | 1 | |
| -122.5112242 | 1 | |
| -122.5090803 | 1 | |
| -122.5089825 | 1 | |
| -122.506548 | 1 | |
| -122.5061813 | 1 | |
| -122.5051716 | 1 |
| Value | Count | Frequency (%) |
| -121.315254 | 1 | |
| -121.3194265 | 1 | |
| -121.3248771 | 1 | |
| -121.3590703 | 1 | |
| -121.3639005 | 1 | |
| -121.3640735 | 1 | |
| -121.4020895 | 1 | |
| -121.4028515 | 1 | |
| -121.4172721 | 1 | |
| -121.4729671 | 1 |
avg_size_neighbor_houses
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 734 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1988.306483 |
| Minimum | 399 |
|---|---|
| Maximum | 6110 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 399 |
|---|---|
| 5-th percentile | 1140 |
| Q1 | 1490 |
| median | 1840 |
| Q3 | 2370 |
| 95-th percentile | 3300 |
| Maximum | 6110 |
| Range | 5711 |
| Interquartile range (IQR) | 880 |
Descriptive statistics
| Standard deviation | 686.1731244 |
|---|---|
| Coefficient of variation (CV) | 0.3451043037 |
| Kurtosis | 1.541837883 |
| Mean | 1988.306483 |
| Median Absolute Deviation (MAD) | 410 |
| Skewness | 1.100529225 |
| Sum | 36680278 |
| Variance | 470833.5566 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1560 | 171 | 0.9% |
| 1540 | 169 | 0.9% |
| 1440 | 166 | 0.9% |
| 1500 | 152 | 0.8% |
| 1460 | 151 | 0.8% |
| 1480 | 145 | 0.8% |
| 1580 | 142 | 0.8% |
| 1720 | 141 | 0.8% |
| 1760 | 141 | 0.8% |
| 1680 | 139 | 0.8% |
| Other values (724) | 16931 |
| Value | Count | Frequency (%) |
| 399 | 1 | < 0.1% |
| 460 | 2 | < 0.1% |
| 620 | 2 | < 0.1% |
| 670 | 1 | < 0.1% |
| 690 | 2 | < 0.1% |
| 700 | 1 | < 0.1% |
| 710 | 2 | < 0.1% |
| 720 | 2 | < 0.1% |
| 740 | 7 | |
| 750 | 3 |
| Value | Count | Frequency (%) |
| 6110 | 1 | < 0.1% |
| 5790 | 5 | |
| 5610 | 1 | < 0.1% |
| 5600 | 1 | < 0.1% |
| 5500 | 1 | < 0.1% |
| 5380 | 1 | < 0.1% |
| 5340 | 1 | < 0.1% |
| 5220 | 1 | < 0.1% |
| 5200 | 1 | < 0.1% |
| 5170 | 1 | < 0.1% |
avg_size_neighbor_lot
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 7865 |
|---|---|
| Distinct (%) | 42.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12571.59622 |
| Minimum | 651 |
|---|---|
| Maximum | 858132 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 651 |
|---|---|
| 5-th percentile | 2004.4 |
| Q1 | 5100 |
| median | 7611 |
| Q3 | 10050 |
| 95-th percentile | 36565.25 |
| Maximum | 858132 |
| Range | 857481 |
| Interquartile range (IQR) | 4950 |
Descriptive statistics
| Standard deviation | 26329.26021 |
|---|---|
| Coefficient of variation (CV) | 2.094345042 |
| Kurtosis | 130.0213224 |
| Mean | 12571.59622 |
| Median Absolute Deviation (MAD) | 2491 |
| Skewness | 9.060581717 |
| Sum | 231920807 |
| Variance | 693229943.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5000 | 375 | 2.0% |
| 4000 | 304 | 1.6% |
| 6000 | 241 | 1.3% |
| 7200 | 183 | 1.0% |
| 7500 | 128 | 0.7% |
| 4800 | 117 | 0.6% |
| 3600 | 96 | 0.5% |
| 8400 | 95 | 0.5% |
| 4500 | 94 | 0.5% |
| 5100 | 92 | 0.5% |
| Other values (7855) | 16723 |
| Value | Count | Frequency (%) |
| 651 | 1 | < 0.1% |
| 659 | 1 | < 0.1% |
| 748 | 2 | |
| 750 | 4 | |
| 755 | 1 | < 0.1% |
| 757 | 1 | < 0.1% |
| 788 | 1 | < 0.1% |
| 809 | 1 | < 0.1% |
| 810 | 2 | |
| 817 | 2 |
| Value | Count | Frequency (%) |
| 858132 | 1 | |
| 560617 | 1 | |
| 438213 | 1 | |
| 434728 | 1 | |
| 425581 | 1 | |
| 411962 | 1 | |
| 392040 | 1 | |
| 386812 | 1 | |
| 380279 | 1 | |
| 360000 | 1 |
| Distinct | 3207 |
|---|---|
| Distinct (%) | 17.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.713536969 |
| Minimum | 0.03 |
|---|---|
| Maximum | 71.25 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 0.03 |
|---|---|
| 5-th percentile | 0.29 |
| Q1 | 0.86 |
| median | 6.91 |
| Q3 | 16.9 |
| 95-th percentile | 26.5 |
| Maximum | 71.25 |
| Range | 71.22 |
| Interquartile range (IQR) | 16.04 |
Descriptive statistics
| Standard deviation | 9.495807697 |
|---|---|
| Coefficient of variation (CV) | 0.9775849649 |
| Kurtosis | 0.790883657 |
| Mean | 9.713536969 |
| Median Absolute Deviation (MAD) | 6.3 |
| Skewness | 1.002206583 |
| Sum | 179195.33 |
| Variance | 90.17036382 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.5 | 94 | 0.5% |
| 0.61 | 93 | 0.5% |
| 0.47 | 91 | 0.5% |
| 0.46 | 90 | 0.5% |
| 0.48 | 85 | 0.5% |
| 0.39 | 83 | 0.4% |
| 0.59 | 83 | 0.4% |
| 0.36 | 82 | 0.4% |
| 0.41 | 81 | 0.4% |
| 0.38 | 79 | 0.4% |
| Other values (3197) | 17587 |
| Value | Count | Frequency (%) |
| 0.03 | 1 | < 0.1% |
| 0.05 | 3 | < 0.1% |
| 0.06 | 1 | < 0.1% |
| 0.07 | 5 | < 0.1% |
| 0.08 | 15 | |
| 0.09 | 14 | |
| 0.1 | 21 | |
| 0.11 | 19 | |
| 0.12 | 27 | |
| 0.13 | 24 |
| Value | Count | Frequency (%) |
| 71.25 | 1 | |
| 70.95 | 1 | |
| 70.54 | 1 | |
| 67.95 | 1 | |
| 67.59 | 1 | |
| 67.58 | 1 | |
| 64.81 | 1 | |
| 64.74 | 1 | |
| 64.06 | 1 | |
| 60.26 | 1 |
supermarketdist
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 3684 |
|---|---|
| Distinct (%) | 20.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.91556158 |
| Minimum | 0.12 |
|---|---|
| Maximum | 77.77 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 0.12 |
|---|---|
| 5-th percentile | 3.44 |
| Q1 | 7.72 |
| median | 14.85 |
| Q3 | 22.86 |
| 95-th percentile | 32.51 |
| Maximum | 77.77 |
| Range | 77.65 |
| Interquartile range (IQR) | 15.14 |
Descriptive statistics
| Standard deviation | 9.725700391 |
|---|---|
| Coefficient of variation (CV) | 0.6110811952 |
| Kurtosis | 0.5914897208 |
| Mean | 15.91556158 |
| Median Absolute Deviation (MAD) | 7.46 |
| Skewness | 0.7501496581 |
| Sum | 293610.28 |
| Variance | 94.5892481 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.01 | 20 | 0.1% |
| 3.86 | 18 | 0.1% |
| 3.55 | 18 | 0.1% |
| 4.2 | 18 | 0.1% |
| 4.29 | 18 | 0.1% |
| 4.46 | 17 | 0.1% |
| 18.27 | 17 | 0.1% |
| 17.07 | 17 | 0.1% |
| 4.79 | 17 | 0.1% |
| 4.05 | 16 | 0.1% |
| Other values (3674) | 18272 |
| Value | Count | Frequency (%) |
| 0.12 | 1 | |
| 0.28 | 1 | |
| 0.29 | 1 | |
| 0.47 | 1 | |
| 0.52 | 1 | |
| 0.56 | 2 | |
| 0.57 | 1 | |
| 0.75 | 1 | |
| 0.78 | 1 | |
| 0.79 | 1 |
| Value | Count | Frequency (%) |
| 77.77 | 1 | |
| 77.51 | 1 | |
| 77.11 | 1 | |
| 74.48 | 1 | |
| 74.16 | 1 | |
| 74.12 | 1 | |
| 71.69 | 1 | |
| 71.57 | 1 | |
| 69.25 | 1 | |
| 67.4 | 1 |
| Distinct | 3388 |
|---|---|
| Distinct (%) | 18.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.58868441 |
| Minimum | 0.05 |
|---|---|
| Maximum | 73.43 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 0.05 |
|---|---|
| 5-th percentile | 1.13 |
| Q1 | 3.21 |
| median | 9.035 |
| Q3 | 19.18 |
| 95-th percentile | 28.25 |
| Maximum | 73.43 |
| Range | 73.38 |
| Interquartile range (IQR) | 15.97 |
Descriptive statistics
| Standard deviation | 9.660199668 |
|---|---|
| Coefficient of variation (CV) | 0.8335889844 |
| Kurtosis | 0.6795075975 |
| Mean | 11.58868441 |
| Median Absolute Deviation (MAD) | 6.525 |
| Skewness | 0.9501622572 |
| Sum | 213788.05 |
| Variance | 93.31945762 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.58 | 40 | 0.2% |
| 2.61 | 34 | 0.2% |
| 2.5 | 30 | 0.2% |
| 2.17 | 29 | 0.2% |
| 2.62 | 29 | 0.2% |
| 1.36 | 28 | 0.2% |
| 2.2 | 28 | 0.2% |
| 1.61 | 27 | 0.1% |
| 3.41 | 26 | 0.1% |
| 1.59 | 26 | 0.1% |
| Other values (3378) | 18151 |
| Value | Count | Frequency (%) |
| 0.05 | 1 | < 0.1% |
| 0.1 | 1 | < 0.1% |
| 0.12 | 1 | < 0.1% |
| 0.14 | 2 | |
| 0.15 | 1 | < 0.1% |
| 0.16 | 2 | |
| 0.18 | 1 | < 0.1% |
| 0.19 | 3 | |
| 0.21 | 2 | |
| 0.22 | 4 |
| Value | Count | Frequency (%) |
| 73.43 | 1 | |
| 73.12 | 1 | |
| 72.71 | 1 | |
| 70.16 | 1 | |
| 69.8 | 1 | |
| 69.78 | 1 | |
| 66.9 | 1 | |
| 66.85 | 1 | |
| 66.14 | 1 | |
| 62.5 | 1 |
| Distinct | 3483 |
|---|---|
| Distinct (%) | 18.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.38343235 |
| Minimum | 0.02 |
|---|---|
| Maximum | 72.83 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 0.02 |
|---|---|
| 5-th percentile | 0.42 |
| Q1 | 1.65 |
| median | 8.6 |
| Q3 | 19.1525 |
| 95-th percentile | 29.4765 |
| Maximum | 72.83 |
| Range | 72.81 |
| Interquartile range (IQR) | 17.5025 |
Descriptive statistics
| Standard deviation | 10.37529877 |
|---|---|
| Coefficient of variation (CV) | 0.9114385231 |
| Kurtosis | 0.273512567 |
| Mean | 11.38343235 |
| Median Absolute Deviation (MAD) | 7.49 |
| Skewness | 0.8714400901 |
| Sum | 210001.56 |
| Variance | 107.6468246 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.74 | 53 | 0.3% |
| 0.41 | 53 | 0.3% |
| 0.5 | 52 | 0.3% |
| 0.4 | 52 | 0.3% |
| 0.47 | 50 | 0.3% |
| 0.89 | 48 | 0.3% |
| 0.62 | 46 | 0.2% |
| 0.63 | 45 | 0.2% |
| 0.56 | 45 | 0.2% |
| 0.69 | 45 | 0.2% |
| Other values (3473) | 17959 |
| Value | Count | Frequency (%) |
| 0.02 | 1 | < 0.1% |
| 0.03 | 1 | < 0.1% |
| 0.04 | 5 | < 0.1% |
| 0.05 | 8 | |
| 0.06 | 4 | < 0.1% |
| 0.07 | 10 | |
| 0.08 | 17 | |
| 0.09 | 13 | |
| 0.1 | 9 | |
| 0.11 | 16 |
| Value | Count | Frequency (%) |
| 72.83 | 1 | |
| 72.51 | 1 | |
| 72.11 | 1 | |
| 69.55 | 1 | |
| 69.19 | 1 | |
| 69.17 | 1 | |
| 66.31 | 1 | |
| 66.25 | 1 | |
| 65.38 | 1 | |
| 63.76 | 1 |
| Distinct | 4218 |
|---|---|
| Distinct (%) | 22.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.76923786 |
| Minimum | 0.24 |
|---|---|
| Maximum | 76.31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 0.24 |
|---|---|
| 5-th percentile | 2.17 |
| Q1 | 6.92 |
| median | 14.26 |
| Q3 | 24.77 |
| 95-th percentile | 37.2165 |
| Maximum | 76.31 |
| Range | 76.07 |
| Interquartile range (IQR) | 17.85 |
Descriptive statistics
| Standard deviation | 11.88112174 |
|---|---|
| Coefficient of variation (CV) | 0.7085069605 |
| Kurtosis | -0.1469904367 |
| Mean | 16.76923786 |
| Median Absolute Deviation (MAD) | 8.67 |
| Skewness | 0.7183724368 |
| Sum | 309358.9 |
| Variance | 141.1610539 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4.53 | 19 | 0.1% |
| 4.26 | 18 | 0.1% |
| 4.33 | 18 | 0.1% |
| 3.99 | 17 | 0.1% |
| 4.25 | 17 | 0.1% |
| 4.01 | 17 | 0.1% |
| 2.87 | 16 | 0.1% |
| 2.97 | 16 | 0.1% |
| 3.35 | 16 | 0.1% |
| 4.06 | 16 | 0.1% |
| Other values (4208) | 18278 |
| Value | Count | Frequency (%) |
| 0.24 | 2 | |
| 0.3 | 1 | |
| 0.31 | 1 | |
| 0.33 | 1 | |
| 0.34 | 1 | |
| 0.35 | 1 | |
| 0.38 | 1 | |
| 0.4 | 1 | |
| 0.41 | 1 | |
| 0.45 | 1 |
| Value | Count | Frequency (%) |
| 76.31 | 1 | |
| 76.02 | 1 | |
| 75.61 | 1 | |
| 72.99 | 1 | |
| 72.65 | 1 | |
| 72.63 | 1 | |
| 72.29 | 1 | |
| 71.04 | 1 | |
| 69.88 | 1 | |
| 69.82 | 1 |
| Distinct | 3749 |
|---|---|
| Distinct (%) | 20.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.47892238 |
| Minimum | 0.07 |
|---|---|
| Maximum | 72.73 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 0.07 |
|---|---|
| 5-th percentile | 1.25 |
| Q1 | 4.14 |
| median | 10.68 |
| Q3 | 20.88 |
| 95-th percentile | 32.33 |
| Maximum | 72.73 |
| Range | 72.66 |
| Interquartile range (IQR) | 16.74 |
Descriptive statistics
| Standard deviation | 10.74770795 |
|---|---|
| Coefficient of variation (CV) | 0.7973714552 |
| Kurtosis | 0.222142814 |
| Mean | 13.47892238 |
| Median Absolute Deviation (MAD) | 7.76 |
| Skewness | 0.8550592604 |
| Sum | 248659.16 |
| Variance | 115.5132262 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.61 | 31 | 0.2% |
| 2.62 | 28 | 0.2% |
| 2.35 | 26 | 0.1% |
| 1.85 | 25 | 0.1% |
| 1.93 | 25 | 0.1% |
| 2.65 | 25 | 0.1% |
| 1.34 | 24 | 0.1% |
| 1.88 | 23 | 0.1% |
| 2.22 | 23 | 0.1% |
| 1.71 | 23 | 0.1% |
| Other values (3739) | 18195 |
| Value | Count | Frequency (%) |
| 0.07 | 1 | < 0.1% |
| 0.08 | 1 | < 0.1% |
| 0.1 | 1 | < 0.1% |
| 0.11 | 1 | < 0.1% |
| 0.13 | 2 | |
| 0.14 | 2 | |
| 0.15 | 2 | |
| 0.17 | 3 | |
| 0.18 | 2 | |
| 0.19 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 72.73 | 1 | |
| 72.43 | 1 | |
| 72.02 | 1 | |
| 69.42 | 1 | |
| 69.06 | 2 | |
| 68.53 | 1 | |
| 67.39 | 1 | |
| 66.31 | 1 | |
| 66.24 | 1 | |
| 61.61 | 1 |
train_stationdist
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 3867 |
|---|---|
| Distinct (%) | 21.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.57107925 |
| Minimum | 0.1 |
|---|---|
| Maximum | 74.54 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 0.1 |
|---|---|
| 5-th percentile | 1.66 |
| Q1 | 5.25 |
| median | 11.74 |
| Q3 | 22.07 |
| 95-th percentile | 33.54 |
| Maximum | 74.54 |
| Range | 74.44 |
| Interquartile range (IQR) | 16.82 |
Descriptive statistics
| Standard deviation | 11.00772986 |
|---|---|
| Coefficient of variation (CV) | 0.7554505519 |
| Kurtosis | 0.1070939973 |
| Mean | 14.57107925 |
| Median Absolute Deviation (MAD) | 7.79 |
| Skewness | 0.8144706752 |
| Sum | 268807.27 |
| Variance | 121.1701167 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.25 | 26 | 0.1% |
| 2.37 | 20 | 0.1% |
| 2.88 | 20 | 0.1% |
| 2.33 | 20 | 0.1% |
| 3.4 | 20 | 0.1% |
| 2.01 | 19 | 0.1% |
| 2.12 | 19 | 0.1% |
| 2.84 | 19 | 0.1% |
| 1.79 | 18 | 0.1% |
| 3.1 | 18 | 0.1% |
| Other values (3857) | 18249 |
| Value | Count | Frequency (%) |
| 0.1 | 1 | < 0.1% |
| 0.15 | 1 | < 0.1% |
| 0.2 | 1 | < 0.1% |
| 0.24 | 3 | |
| 0.25 | 1 | < 0.1% |
| 0.26 | 4 | |
| 0.27 | 2 | |
| 0.28 | 2 | |
| 0.29 | 2 | |
| 0.32 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 74.54 | 1 | |
| 74.25 | 1 | |
| 73.84 | 1 | |
| 71.23 | 1 | |
| 70.87 | 2 | |
| 68.9 | 1 | |
| 68.87 | 1 | |
| 68.15 | 1 | |
| 68.07 | 1 | |
| 63.54 | 1 |
universitydist
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 4013 |
|---|---|
| Distinct (%) | 21.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.22622615 |
| Minimum | 0.12 |
|---|---|
| Maximum | 73.53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 0.12 |
|---|---|
| 5-th percentile | 1.43 |
| Q1 | 5.9 |
| median | 12.53 |
| Q3 | 22.58 |
| 95-th percentile | 35.0365 |
| Maximum | 73.53 |
| Range | 73.41 |
| Interquartile range (IQR) | 16.68 |
Descriptive statistics
| Standard deviation | 11.2432905 |
|---|---|
| Coefficient of variation (CV) | 0.7384160983 |
| Kurtosis | 0.09924153784 |
| Mean | 15.22622615 |
| Median Absolute Deviation (MAD) | 7.83 |
| Skewness | 0.8019840244 |
| Sum | 280893.42 |
| Variance | 126.4115814 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.55 | 22 | 0.1% |
| 3.23 | 20 | 0.1% |
| 1.68 | 20 | 0.1% |
| 1.35 | 20 | 0.1% |
| 1.5 | 20 | 0.1% |
| 3.95 | 20 | 0.1% |
| 1.37 | 19 | 0.1% |
| 4.69 | 18 | 0.1% |
| 2.99 | 17 | 0.1% |
| 1.4 | 17 | 0.1% |
| Other values (4003) | 18255 |
| Value | Count | Frequency (%) |
| 0.12 | 1 | < 0.1% |
| 0.13 | 1 | < 0.1% |
| 0.17 | 3 | |
| 0.18 | 2 | |
| 0.19 | 3 | |
| 0.2 | 2 | |
| 0.21 | 2 | |
| 0.22 | 2 | |
| 0.23 | 2 | |
| 0.25 | 2 |
| Value | Count | Frequency (%) |
| 73.53 | 1 | |
| 73.25 | 1 | |
| 72.84 | 1 | |
| 71.05 | 1 | |
| 70.22 | 1 | |
| 69.87 | 1 | |
| 69.86 | 1 | |
| 68.83 | 1 | |
| 67.21 | 1 | |
| 67.12 | 1 |
| Distinct | 3572 |
|---|---|
| Distinct (%) | 19.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.37227179 |
| Minimum | 0.08 |
|---|---|
| Maximum | 71.04 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 144.2 KiB |
Quantile statistics
| Minimum | 0.08 |
|---|---|
| 5-th percentile | 2.62 |
| Q1 | 6.15 |
| median | 10.21 |
| Q3 | 19.3225 |
| 95-th percentile | 30.7665 |
| Maximum | 71.04 |
| Range | 70.96 |
| Interquartile range (IQR) | 13.1725 |
Descriptive statistics
| Standard deviation | 9.466188402 |
|---|---|
| Coefficient of variation (CV) | 0.7078967994 |
| Kurtosis | 1.061115416 |
| Mean | 13.37227179 |
| Median Absolute Deviation (MAD) | 5.25 |
| Skewness | 1.105302022 |
| Sum | 246691.67 |
| Variance | 89.60872286 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.18 | 25 | 0.1% |
| 5.81 | 24 | 0.1% |
| 5.41 | 23 | 0.1% |
| 5.35 | 23 | 0.1% |
| 5.82 | 23 | 0.1% |
| 7.46 | 22 | 0.1% |
| 7.21 | 22 | 0.1% |
| 6.29 | 22 | 0.1% |
| 5.69 | 22 | 0.1% |
| 6.04 | 22 | 0.1% |
| Other values (3562) | 18220 |
| Value | Count | Frequency (%) |
| 0.08 | 1 | |
| 0.13 | 1 | |
| 0.14 | 1 | |
| 0.2 | 1 | |
| 0.21 | 1 | |
| 0.24 | 2 | |
| 0.25 | 1 | |
| 0.31 | 1 | |
| 0.32 | 2 | |
| 0.35 | 1 |
| Value | Count | Frequency (%) |
| 71.04 | 1 | |
| 70.74 | 1 | |
| 70.33 | 1 | |
| 67.74 | 1 | |
| 67.7 | 1 | |
| 67.38 | 1 | |
| 67.37 | 1 | |
| 66.03 | 1 | |
| 64.57 | 1 | |
| 64.51 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| price | num_bed | num_bath | size_house | size_lot | num_floors | is_waterfront | condition | size_basement | year_built | renovation_date | zip | latitude | longitude | avg_size_neighbor_houses | avg_size_neighbor_lot | schooldist | supermarketdist | warehousedist | churchdist | collegedist | hospitaldist | train_stationdist | universitydist | hangardist | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 221900 | 3 | 1.00 | 1180 | 5650 | 1.0 | 0 | 3 | 0 | 1955 | 0 | 98178 | 47.511234 | -122.256775 | 1340 | 5650 | 0.23 | 6.27 | 2.32 | 3.55 | 12.70 | 6.87 | 8.62 | 10.55 | 5.28 |
| 1 | 538000 | 3 | 2.25 | 2570 | 7242 | 2.0 | 0 | 3 | 400 | 1951 | 1991 | 98125 | 47.721023 | -122.318862 | 1690 | 7639 | 0.91 | 12.04 | 1.46 | 0.21 | 2.58 | 1.80 | 2.08 | 6.51 | 5.64 |
| 2 | 180000 | 2 | 1.00 | 770 | 10000 | 1.0 | 0 | 3 | 0 | 1933 | 0 | 98028 | 47.737927 | -122.233196 | 2720 | 8062 | 4.26 | 16.10 | 4.82 | 4.58 | 8.60 | 7.95 | 8.10 | 9.42 | 6.11 |
| 3 | 604000 | 4 | 3.00 | 1960 | 5000 | 1.0 | 0 | 5 | 910 | 1965 | 0 | 98136 | 47.520820 | -122.393185 | 1360 | 5000 | 1.23 | 6.16 | 4.69 | 2.03 | 11.95 | 7.80 | 8.93 | 8.47 | 6.04 |
| 4 | 510000 | 3 | 2.00 | 1680 | 8080 | 1.0 | 0 | 3 | 0 | 1987 | 0 | 98074 | 47.616812 | -122.044901 | 1800 | 7503 | 17.55 | 22.18 | 20.45 | 18.70 | 20.80 | 18.47 | 19.61 | 18.18 | 17.88 |
| 5 | 1225000 | 4 | 4.50 | 5420 | 101930 | 1.0 | 0 | 3 | 1530 | 2001 | 0 | 98053 | 47.656118 | -122.005287 | 4760 | 101930 | 19.44 | 25.92 | 22.92 | 21.51 | 24.16 | 20.74 | 22.44 | 21.37 | 19.45 |
| 6 | 257500 | 3 | 2.25 | 1715 | 6819 | 2.0 | 0 | 3 | 0 | 1995 | 0 | 98003 | 47.309720 | -122.327049 | 2238 | 6819 | 21.59 | 26.40 | 22.55 | 25.86 | 34.14 | 28.10 | 29.99 | 30.93 | 25.49 |
| 7 | 229500 | 3 | 1.00 | 1780 | 7470 | 1.0 | 0 | 3 | 730 | 1960 | 0 | 98146 | 47.512294 | -122.336595 | 1780 | 8113 | 1.74 | 4.04 | 3.28 | 4.84 | 11.67 | 5.92 | 7.68 | 8.40 | 3.29 |
| 8 | 323000 | 3 | 2.50 | 1890 | 6560 | 2.0 | 0 | 3 | 0 | 2003 | 0 | 98038 | 47.368407 | -122.030818 | 2390 | 7570 | 22.77 | 29.56 | 25.01 | 26.42 | 35.25 | 30.03 | 31.59 | 33.71 | 28.57 |
| 9 | 662500 | 3 | 2.50 | 3560 | 9796 | 1.0 | 0 | 3 | 1700 | 1965 | 0 | 98007 | 47.600660 | -122.145296 | 2210 | 8925 | 10.79 | 14.51 | 12.80 | 11.11 | 13.37 | 12.30 | 11.89 | 10.98 | 11.64 |
Last rows
| price | num_bed | num_bath | size_house | size_lot | num_floors | is_waterfront | condition | size_basement | year_built | renovation_date | zip | latitude | longitude | avg_size_neighbor_houses | avg_size_neighbor_lot | schooldist | supermarketdist | warehousedist | churchdist | collegedist | hospitaldist | train_stationdist | universitydist | hangardist | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 18438 | 224000 | 3 | 1.75 | 1500 | 11968 | 1.0 | 0 | 3 | 0 | 2014 | 0 | 98010 | 47.309481 | -122.002146 | 1320 | 11303 | 28.85 | 35.74 | 31.06 | 32.81 | 41.80 | 36.37 | 38.01 | 40.06 | 34.77 |
| 18439 | 507250 | 3 | 2.50 | 2270 | 5536 | 2.0 | 0 | 3 | 0 | 2003 | 0 | 98065 | 47.538886 | -121.881214 | 2270 | 5731 | 28.56 | 33.13 | 30.76 | 29.29 | 34.24 | 32.15 | 32.23 | 31.99 | 31.21 |
| 18440 | 429000 | 3 | 2.00 | 1490 | 1126 | 3.0 | 0 | 3 | 0 | 2014 | 0 | 98144 | 47.569929 | -122.288021 | 1400 | 1230 | 0.43 | 3.55 | 3.01 | 1.17 | 5.78 | 1.61 | 2.07 | 4.36 | 0.40 |
| 18441 | 610685 | 4 | 2.50 | 2520 | 6023 | 2.0 | 0 | 3 | 0 | 2014 | 0 | 98056 | 47.513674 | -122.167422 | 2520 | 6023 | 6.89 | 12.15 | 9.05 | 8.35 | 16.30 | 11.85 | 13.09 | 15.35 | 11.17 |
| 18442 | 1007500 | 4 | 3.50 | 3510 | 7200 | 2.0 | 0 | 3 | 910 | 2009 | 0 | 98136 | 47.553718 | -122.398209 | 2050 | 6200 | 1.35 | 5.85 | 4.64 | 1.50 | 9.08 | 6.74 | 7.10 | 5.82 | 6.29 |
| 18443 | 360000 | 3 | 2.50 | 1530 | 1131 | 3.0 | 0 | 3 | 0 | 2009 | 0 | 98103 | 47.699285 | -122.346105 | 1530 | 1509 | 1.43 | 9.46 | 2.26 | 0.12 | 0.91 | 1.37 | 1.41 | 4.89 | 6.37 |
| 18444 | 400000 | 4 | 2.50 | 2310 | 5813 | 2.0 | 0 | 3 | 0 | 2014 | 0 | 98146 | 47.510733 | -122.361867 | 1830 | 7200 | 1.04 | 5.08 | 3.85 | 3.66 | 12.17 | 6.96 | 8.50 | 8.73 | 4.58 |
| 18445 | 402101 | 2 | 0.75 | 1020 | 1350 | 2.0 | 0 | 3 | 0 | 2009 | 0 | 98144 | 47.594358 | -122.298654 | 1020 | 2007 | 0.56 | 4.09 | 1.33 | 1.66 | 3.03 | 1.63 | 0.54 | 2.14 | 2.73 |
| 18446 | 400000 | 3 | 2.50 | 1600 | 2388 | 2.0 | 0 | 3 | 0 | 2004 | 0 | 98027 | 47.534499 | -122.069087 | 1410 | 1287 | 14.46 | 19.02 | 16.66 | 15.17 | 21.08 | 18.18 | 18.64 | 19.28 | 17.34 |
| 18447 | 325000 | 2 | 0.75 | 1020 | 1076 | 2.0 | 0 | 3 | 0 | 2008 | 0 | 98144 | 47.594059 | -122.298635 | 1020 | 1357 | 0.59 | 4.11 | 1.34 | 1.70 | 3.05 | 1.66 | 0.52 | 2.16 | 2.70 |
Most frequently occurring
| price | num_bed | num_bath | size_house | size_lot | num_floors | is_waterfront | condition | size_basement | year_built | renovation_date | zip | latitude | longitude | avg_size_neighbor_houses | avg_size_neighbor_lot | schooldist | supermarketdist | warehousedist | churchdist | collegedist | hospitaldist | train_stationdist | universitydist | hangardist | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 550000 | 4 | 1.75 | 2410 | 8447 | 2.0 | 0 | 4 | 350 | 1936 | 1980 | 98074 | 47.649901 | -122.088260 | 2520 | 14789 | 13.37 | 19.66 | 17.31 | 15.45 | 17.91 | 14.56 | 16.20 | 15.11 | 13.52 | 2 |
| 1 | 555000 | 3 | 2.50 | 1940 | 3211 | 2.0 | 0 | 3 | 0 | 2009 | 0 | 98027 | 47.564378 | -122.093255 | 1880 | 3078 | 13.36 | 17.26 | 15.80 | 13.56 | 18.13 | 16.07 | 16.02 | 16.08 | 15.07 | 2 |
| 2 | 585000 | 3 | 2.50 | 2290 | 5089 | 2.0 | 0 | 3 | 0 | 2001 | 0 | 98006 | 47.544285 | -122.171537 | 2290 | 7984 | 7.31 | 11.26 | 9.50 | 7.44 | 13.87 | 10.40 | 11.12 | 12.57 | 9.60 | 2 |